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pL] Abstract 

^ Economic modeling in the presence of endogeneity is subject to model uncertainty 

at both the instrument and covariate level. We propose a Two-Stage Bayesian Model 
Averaging (2SBMA) methodology that extends the Two-Stage Least Squares (2SLS) 
' estimator. By constructing a Two-Stage Unit Information Prior in the endogenous 

^ variable model, we are able to efficiently combine established methods for addressing 

'i> model uncertainty in regression models with the classic technique of 2SLS. To assess 

^ the validity of instruments in the 2SBMA context, we develop Bayesian tests of the 

QQ identification restriction that are based on model averaged posterior predictive p- values. 

*0 A simulation study showed that 2SBMA has the ability to recover structure in both 

Cn| the instrument and covariate set, and substantially improves the sharpness of resulting 

coefficient estimates in comparison to 2SLS using the full specification in an automatic 
fashion. Due to the increased parsimony of the 2SBMA estimate, the Bayesian Sargan 
test had a power of 50 percent in detecting a violation of the exogeneity assumption, 
while the method based on 2SLS using the full specification had negligible power. We 
apply our approach to the problem of development accounting, and find support not 
only for institutions, but also for geography and integration as development determi- 
nants, once both model uncertainty and endogeneity have been jointly addressed. 
Keywords: Bayesian Model Averaging, Endogeneity, Instrumental variables. Posterior 
Predictive P- Values 
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1 Introduction 



In modeling economic interactions, applied economists are often faced with a surfeit of the- 



ories and a number of different variables that proxy a given theory. Raftery (1995) argued 
that uncertainty surrounding particular theories should be addressed explicitly by the sta- 
tistical approach. Uncertainty about a parameter can be underestimated when it is based 
on a single model and there is uncertainty about which model or theory to use. Bayesian 
model averaging (BMA) has been used extensively to account for model uncertainty in re- 
gression models, including in cross-country growth regressions and development accounting]^ 
However, these methods are constrained to modeling assocations between covariates and the 
dependent variable, ignoring issues of endogeneity and uncertainty surrounding instrument 
specifications. 

We propose a new methodology that combines the BMA approach to regression variable 
uncertainty and the Two-Stage Least Squares (2SLS) procedure. Our new method, Two- 
Stage BMA (2SBMA), enables the applied economist to address uncertainty in instrument 
and covariate specifications in a natural manner while simultaneously yielding an estimator 
that addresses endogeneity in the covariates. We propose this technique as an automatic 
alternative to the standard 2SLS exercises, which often involve extensive, and typically un- 
reported remodeling in order to obtain a reduced, relevant, covariate set. 

To date, instrument uncertainty has been addressed only in standard robustness analy- 
ses that juxtapose instruments associated with one particular theory/specification against 
another. In a prominent example in the development accounting literature, Rodrik et al. 
(2004), henceforth RST, motivate their work by a "horse race" among alternative theories 
that propose candidate instruments and regressors. 

Accounting for uncertainty about both covariates and instruments requires a method- 



ology that is rooted in statistical theory. Durlauf et al. (2007) introduced an instrument 
model selection procedure based on evaluating coefficient estimates according to t-statistics, 
but warned of the tenuous nature of the underlying statistical theory. The most comprehen- 
sive approach to addressing endogeneity in growth regressions has previously been proposed 



by Durlauf et al. (2008), who built on Tsangarides (2004). The authors introduced a model 
averaged version of 2SLS, but noted that their heuristic approach lacked statistical justifi- 
cation]^ Strictly speaking, Durlauf et al. (2008) did not allow for instrument uncertainty, 
but provided a model averaging approach to instrument candidate regressors in the second 



^See e.g., Fernandez et al. ( 200l| ), Sala-i-Martin et al. (2004), Ciccone and Jarocinski (2007), and Eicher 
et al. ( [2010| ). 

similar heuristic panel approach is introduced by Hineline (2007) to examine the growth/inflation 
relationship. Morales-Benito (2009) provides statistical foundations for a panel BMA approach. 
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stage only. 

The 2SBMA approach extends the method of Durlauf et al. (2008) and provides a statis- 
tical foundation for the procedure. An alternative to theory-specified instruments is the use 
of lagged dependent variables in a BMA-GMM approach (see Morales-Benito (2009)). We 
explore the properties of 2SBMA as a valid two-stage estimator and show that the procedure 
is a consistent methodology that reduces the well known many-instrument bias present in 
2SLS. A simulation study reported below shows a 45% reduction in many instrument bias, 
as well as a 46% reduction in mean squared error for estimating regression coefficients as 
opposed to 2SLS using the full specification. 

Instrumental variable estimation requires assumptions that relate to the identification of 



the implied structural model. Tests of the identification restrictions, such the Sargan (1958) 
and Cragg and Donald ( 1993 ) tests, compare a test statistic to a reference distribution. The 
test statistics are often asymptotically distributed according to the reference distribution, 
which frequently has a number of degrees of freedom that is related to the size of the model 
estimated. The nature of these statistics proves problematic in some economic applications, 
such as the study of growth and development determinants, where the sample size is small 
and additional regressors continue to be proposed. 

The 2SBMA approach provides a direct interpretation of the efficacy of an instrumen- 
tation strategy, by examining posterior inclusion probabilities. However, we also provide 
measures to verify instrument conditions based on Bayesian model averaged posterior pre- 
dictive p-values. We introduce these p-values as a simple extension of the work of Rubin 



(1984), Raftery (1988) and Gelman et al. (1996), who motivated the use of posterior predic- 
tive p-values for a single model. These p-values are then used to derive Bayesian versions of 
the Sargan and Cragg and Donald tests to test the identification restrictions. 

A simulation study quantifies the efficiency of 2SBMA as compared to 2SLS. We found 
that the Bayesian over-identification test had a power of 50% at detecting this failure, while 
the traditional Sargan test had a power of only 0.8%. We note that this results from the 
increased parsimony that the 2SBMA estimator obtains in comparison to the 2SLS estima- 
tor, highlighting the fact that 2SBMA is able to automatically introduce beneficial levels of 
parsimony in the modeling exercise. 

Finally, we apply 2SBMA to a prominent approach to development accounting that is 
known to be subject to substantial instrument and determinant uncertainty]^ We use RST's 



■^We use the term development accounting in the broad sense, referring to studies that seek to examine 
the determinants of differences in levels of per capita income. Previous development accounting approaches 
differ in their emphases, such as physical capital (King and Levine (1994|), human capital (Klenow et al. 
p97| ), as weU as TFP (CaseUi ( [2005t ). In our application we focus only on studies that sought to explain to 
differences in per capita incomes based on integration, institutions, and geography since the above approaches 
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own data and the variables motivated by their empirical approach to span the model space 
for our 2SBMA approach. RST found strong evidence for the "primacy of institutions" over 
all other alternative theories after conducting an elaborate "horse race" among all alternative 
theories and their associated candidate regressors. The 2SBMA results suggest a qualifica- 
tion of RST's strong conclusions. We find that not only institutions, but also integration and 
geography explain differences in per capita incomes across countries. This suggests that pre- 
viously reported robustness results may have depended largely on which specifications were 
used. In addition we find that a number of instruments that are associated with alternative 
theories also receive support from the data, once instrument and determinant uncertainty 
have been addressed simultaneously. These results are consistent with the findings of Durlauf 



et al. (2008) that many growth theories/variables are not robust once model uncertainty is 
taken into account. 

Through his work on Bayesian regression and time series, Arnold Zellner had consider- 
able influence on the material presented here. In particular, one of the main themes of this 
work is proposing a proper prior distribution on the regression parameters. The g-prior of 



Zellner (1986) is often used for model selection exercises in the regression context because 
it is a proper prior distribution. In the model we consider below, we extend the Unit In- 
formation Prior (Kass and Wasserman, 1995), which can be seen as a specific version of 
Zellner's g-prior. Arnold Zellner was also an early contributor to the literature on Bayesian 



combination of forecasts and models (Palm and Zellner, 1992 Min and Zellner, 1993), which 
underlies much of the present work. 

The article proceeds as follows. Section 2 outlines the statistical foundations of the 
2SBMA approach. Section 3 introduces Bayesian tests of the identification restrictions and 
Section 4 describes the efficiency of 2SBMA and the power of Bayesian tests in simulation 
studies. Section 5 applies 2SBMA to a prominent approach to development accounting to 
highlight the importance of both determinant and instrument uncertainty. Section 6 con- 
cludes. 



2 Statistical Foundations of 2SBMA 

This section develops the theoretical foundations for two-stage Bayesian Model Averaging 
(2SBMA). First, we sketch the properties of the 2SLS estimator that we extend. Then we 
formulate the endogeneity problem to express first and second stage likelihoods in a model 

do not feature instrument uncertainty. 
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averaging context. After developing a two-stage prior that allows for an efficient approxima- 
tion of the model probabilities, we then derive the complete 2SBMA methodology. 



2.1 Marginal and Conditional Likelihoods for Two-Stage Appli- 
cations 

A standard approach to addressing endogeneity is to apply 2SLS and impose identification 
restrictions. We consider the model, 

Y = {WX)l3 + r,, (1) 
W={ZX)e + e, (2) 

where Y is the n x 1 dependent variable, X is an nx set of covariates, W is the set of 
endogenous variables and Z is the n x pz set of instruments. The notation {A B) denotes a 
matrix placing A and B together, provided they have the same number of rows. To simplify 
exposition we assume that W is n x 1. Assuming that 



the classical endogenous variable situation arises when 7^ 0, causing W to violate the 
standard regression assumption of independence of the error term, t]. 

Fundamental to the 2SBMA methodology is the specification of two-stage marginal and 
conditional likelihoods. Letting U = (W X) and V = {Z X), we adopt the notation of 
Kleibergen and Zivot (2003), and rewrite Equations [l] and [2] as 



Y = Ui9)^ + u, (1') 
W = Ve + e, (2') 

where U{6) = {V6 X) is the replacement of W in U with its fitted value, and the OLS 
estimate regressing W on V is given by f/ = U{6) where 6 = (y'V)~^V'W . Since u = Pe + rj, 
we have 



Kleibergen and Zivot (2003) reparameterize ([r| and (2') using = c<;22^W2i to yield 



Y = U{9)f3 + ^ + e(P, (1") 
W = Ve + e, (2") 
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where var{^) = i>J\\.2 = uju — 
In the presence of endogeneity, it is well known that the correlation between u and e 
leads to inconsistent estimates of the entire coefficient vector, /3, under standard Ordinary 
Least Squares (OLS). The 2SLS estimator, 



may resolve the inconsistency problem when instruments, Z, exist that are independent 
of Y, given W and the vector of covariates, X. The model is identified only when these 
conditional independence assumptions are valid, resulting in a consistent estimator. The 
2SLS estimate of /3 is well known to be biased, and the extent of this bias increases with 
the number of terms that are added in the first stage with coefficients equal, or close to, 
zero. Under such conditions, 2SLS estimates may not reduce the bias of the OLS results 



(Davidson and MacKinnon, 2004). It is therefore of practical interest to develop a 2SBMA 
methodology that quantifies the degree to which each proposed covariate and instrument 
enters into the equations above. 



The reparameterization of (1") and (2") proves useful to derive the two-stage marginal 
and conditional likelihoods. 

Theorem 1. The two-stage marginal and conditional likelihoods are given by 

L{/3\(f),uJu-2,Y,e) = u^^.^ exp , (4) 

y ^ 1^11-2 J 

where e = W — V6. The marginal likelihood is maximized at 6 = {V'V)~^V'W , while the 
conditional likelihood is maximized at (3'^^^^ . 

Proof See Appendix. 



2.2 Constrained Two-Stage Likelihoods 

When juxtaposing competing theories using 2SBMA, some coefficients in either 6 or P are 
constrained to be zero. As a consequence, these members of either X ot Z need not be 
included in estimation of the associated parameter. This section derives the constrained 
marginal and conditional likelihoods for this case. 

Let Ai he a. collection of individual first stage models, {Mi, . . . ,M/}. Associated with 
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each first stage model Mj is a submatrix V^^*^ of the matrix V. When the variables Zi and 
are in Mj, then these columns are retained in V^^\ while those not in Mj are excluded. 
For Mi, the least squares estimate is 9^'^ = (V^(^)V(^))"^ V'-'yW. 

Similarly, let £ be a set of individual second stage models, {Li, . . . ,Lj}. Each model 
Lj again corresponds to a submatrix U^^^ of the matrix U. For W G Lj, U^^\6'^'^'>) denotes 
the replacement of W in U^^^ with the fitted value V"'-*^^*^*-', and when W ^ Lj, we write 
f/(i)(^(»)) = f/(i). To ease notation, we write f/(^'^) for U^^\e^^^). 



Analogous to the representation in Equations 1" and |2"[ the sampling model under M, 



and Lj takes the form 



Vr = VW^« + eW, (6) 



where var{e^'^^) = 0022, var{^'-^'^^) = uii{''2- The constrained 2SLS estimator based on models 



Mi and Lj is then calculated as 



Theorem 2 highlights how the marginal and conditional likelihoods are affected when a 
variable contained in U^^^ is excluded from from V^''\ 

Theorem 2. Under models Mi and Lj the expressions in Equations ^ and ^ yield the 
following marginal and conditional likelihoods: 



L(e\W,V,u^^) = (o 



uj'22 exp 



2 co^} 



22 



(/3|0(^'^),c.i\'5,F,^) = (a;!l'S)"'^ exp 



n/2 / i(y _;7OO/3OO+eW0(MO)'(y_;7(i)/3(i) + e«0(M)) 



2 



11-2 



(9) 

where e^*^ = W — V^''^9^'^\ Equation zs maximized at 6^'^\ while Equation is maximized 
at 

^(M) + 0{i,i)n{M)^ (10) 

where fl^*'^') = (f/(j')'?7(^'))-i?7(^')'e». 
Proof See appendix. 

Theorem [2] implies that the mode of the conditional likelihood for (3 equals plus 
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the term 0(*'-^^n(*'-'^. To interpret this difference we note that fl^*'-?) represents the regression 
coefficients of f/^-'-' on the first stage residuals. Hence for the case where all covariates in Lj 
are contained in Mi, n^*'-') = and Equation (10) becomes For the case where variable 

Xi is in Lj but excluded from Mj, and where Xi has little explanatory power for W given 
the remaining elements of Mj, fl^*'-^) is, by definition, negligible. The 2SBMA assumptions 
outlined below that govern the model pair (Mi, Lj) constrain n^*'-?) to zero. 

The 2SBMA methodology is based on calculating the quantities in Equation ([T]). As 
opposed to Limited Information Maximum Likelihood (LIML), the focus of our estimator 
is on maximizing the marginal, then the conditional likelihoods, yielding estimators of the 
form while LIML maximizes the joint likelihood of the parameters. This causes our 

methodology to be more similar to 2SLS than LIML. Indeed, note that when Mj and Lj are 
the full models for the first and second stage, respectively, = (3^^^^ . Furthermore, if 
Xi is the only variable excluded from Mi and Lj, is simply the 2SLS estimate with this 
covariate excluded. Similarly, if Zm is the only quantity excluded from Mi, and Lj is the full 
model, is the 2SLS estimate with instrument Z^, excluded. 

The formulation above also allows for two occurrences not considered in standard 2SLS. 
First, we may have variable Xi G Mi, but Xi ^ Lj. This is equivalent to stating that a 
variable originally considered to be a covariate is now considered conditionally independent 
of Y, given the remaining variables in Lj. However, the variable still has some explanatory 
power for W, which causes it to be included in Mj. Second, we may include variable Xi G Lj, 
but exclude it from Mi. This case is not considered in standard 2SLS analysis, as it is well 
known to result in bias if the effect of Xi on W is substantial, as outlined by the term H^*'-') 
in Theorem [2] above. In 2SBMA we do allow for these occurrences, since a large H^*'-'^ would 
imply a failure in the model selection methodology during the first stage. 



2.3 The Two-Stage Unit Information Prior 

2SBMA requires a two-stage prior distribution over the parameters of the model. In the 
context of Bayesian Instrumental Variable estimation, Dreze (1976) suggested the improper 
prior 

pr(/3,^,S)oc|Sr^/^ 

for Equations ([T]) and ([2]). This two-stage prior is in part motivated by the standard improper 
prior pr{P, a) oc l/cx placed on Bayesian regression problems. 

Kleibergen and Zivot (2003) note that the Dreze prior may be poorly behaved as — ?■ 0. 
Instead they suggest 



pr{l3,e,(P,uJn.2,co22) oc co^^^ |^]^'/' \9'V'V9\'^^ 



'IV. 
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a prior that includes dependence on the first stage coefficients, 6. Kleibergen and Zivot 



(2003) note that the posterior from this prior bears a number of similarities to the 2SLS 
estimator. In particular, the conditional posterior of /3 when 9 = 9 has mean and mode 
equal to fi'^^^^ . 

While the Kleibergen and Zivot prior has a number of desirable properties, it is still 
improper in several parameters. This may not have undue influence on standard Bayesian 



posterior parameter estimation. Kass and Raftery (1995) note, however, that the use of 
improper priors has a greater impact in the context of model comparison. Hence we seek 
to develop a two-stage prior which shares many of the properties of the Kleibergen and 



Zivot (2003) prior, but is proper. We choose to base our analysis on the Unit Information 



Prior (UIP), which has been motivated in the context of BMA (Kass and Wasserman, 1995 



Raftery, 1995). The UIP is a normal prior with a mean centered at the maximum likelihood 
estimate and a variance equal to the inverse of the average information contained in one 
observation. 

We construct a prior of the form 

pr{(3, 9, un-2, ^22, 0) = pr{l3, un-2, 4>\9, uj22)pr{9, U22), 

and specify a standard UIP on pr{9,uj22), based on the likelihood in Equation ([s]). In the 
second stage, we construct a prior that is conditional on the first stage, pr{(3, c<;ii.2, 016*, U22), 
and specify a UIP for given values of (9,U22), based on the likelihood in Equation ([9]). By 
Theorem [l| the resulting two-stage UIP centers (3 at ^'^^^^ when 9 is set to its mode, 9. 

A prior over parameters of restricted models, (Mj, Lj), is specified in essentially the same 
manner 

pr (/3(^),^«,u;ft5,4?,</>(^'^)) =pr (/3(^), ^'J, eg) pr , 

where a UIP is again specified on pr{9'^'^\uj 22)- Then, conditional on a value of 9'^'^\ we specify 
a UIP on (5^^\ 

By imposing the conditional independence assumption on the pair of models {Mi,Lj), 
we can set n^^'-') = 0. Therefore, when 9^'^ = 9^'\ the prior distribution for (3^^^ is centered 
about i3^'^'^\ This implies that the two-stage UIP can be considered a proper analogue of the 



Kleibergen and Zivot (2003) prior. That is, the prior features a built-in dependence on 9, 



while its posterior mimics the 2SLS estimator. 



2.4 Statistical Foundations of 2SBMA 



2SBMA combines the 2SLS methodology discussed above with the standard BMA method- 
ology reviewed below. 2SBMA processes the data much like a two-stage estimator while 
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addressing model uncertainty in both stages. The first stage is a simple apphcation of BMA 
to identify effective instruments. It is helpful to review the properties of BMA that are 
implied in stage 1. 

Let A be a quantity of interest. In BMA, the posterior distribution of A given the 
data, D, is given by the weighted average of the predictive distributions under each model, 
weighted by the corresponding posterior probabilities, 

pr(A|L>) = J]pr(A|M,,L>)pr(M,|L>), 
1=1 

where pr(A|Mj, D) is the predictive distribution given model Mj and pr{Mi\D) is the poste- 
rior model probability of model Mj. The posterior model probability tt^, for each first stage 
model Mi is given by 

TTi = pr{Mi\D) 

oc pr(D\Mi)pr(Mi), 

where 

pr{D\Mi) = J pr{D\9^'\Mi)pr{9^^\Mi)d9^^ 

is the integrated likelihood of model Mj with parameters 9^'^\ The prior densities for param- 
eters and models are pr(^^*)|Mj) and pr{Mi), respectively. 

Under BMA, the posterior mean of 9 is the sum of the posterior means of each model in 
the collection M., weighted by their posterior probabihties 

i=l 

Similarly, the posterior variance is 

1=1 i=i 

The posterior variance highlights how BMA methodology accounts for model uncertainty. 
The first term is the weighted variance for each model, a'^ — Var{9^^''\Mi, D), averaged 
over all relevant models, and the second term indicates how stable the estimates are across 
models. The more the estimates differ between models, the greater is the posterior variance. 

The posterior distribution for a parameter is a mixture of a regular posterior distribution 
and a point mass at zero, which represents the probability that the parameter equals zero. 
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The sum of the posterior probabihties of the models that contain the variable is called the 
inclusion probability. For instance, for instrument we may write, 



where A^^ is a collection of indices for which i & M.f^ implies model Mj does not restrict the 
parameter Oz^, to zero. Standard rules of thumb for interpreting fi^^^^ have been provided by 



Kass and Raftery (1995), modifying an earlier proposal of Jeffreys (1961). They suggested 
the following effect thresholds: <50%: evidence against the effect, 50-75%: weak evidence 
for the effect, 75-95%: positive evidence, 95-99%: strong evidence, and >99%: very strong 
evidenc^ 

In the case of 2SLS estimation in the presence of model uncertainty, the BMA framework 
must be extended to account for model uncertainty at both stages. For models Mj and Lj, 
we are now interested in 

pr{Mi,L,\D) = pr{Lj\M,,D)pr{Mi\D). (12) 



The decomposition in Equation ( 12 ) indicates the dependence that the probability of model 
Lj has on the particular choice of model Mj, as each model Mi yields a slightly different 
distribution for the fitted value of W. Furthermore, for a given value of 9^'^\ the quantity 
pr{Lj\9^'\Mi,D) is itself a regression problem with a particular UIP. 

Denoting by the probability of model Lj given model Mj yields the 2SBMA estimator 

^2SBMA^J2J2^.U^J^-^\ (13) 
i=l j=l 



Equation (13) shows that the 2SBMA estimate is formed as the average of each constrained 
2SLS estimate that results from the combination of model M, in the first stage, and model 
Lj in the second stage, weighted by both the first and second stage model probabilities. To 



calculate the posterior variance of p^^^'^'^^ we have the following result. 

Theorem 3. Let = X]j=i ^he model averaged estimate of (3 for a given first 

stage model Mi . Then the variance of the estimate ^^sbma 

i=l i=l 



*From a decision theory perspective, one can imagine cases where a pohcy-maker might be interested in 



a variable even when its posterior inchision probabihty is below 50% ; see Brock and Durlauf (20011 and 



Brock et al. (20031 
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where 

J 2 

i=i i=i 
is the BMA variance associated with second stage estimates for a fixed first stage model. 

Proof See Appendix. 

Theorem [3] shows that the variance of 2SBMA estimates can be decomposed into two 
parts that yield interpretations similar to standard BMA variances. The first term is the 
average of BMA variances associated with the models in the first stage, and the second term 
represents the variation of a given first stage model's BMA estimates relative to the overall 
2SBMA estimate. 

The posterior distribution of yj^ssMA jg again a mixture of a regular posterior distribution 
and a point mass at zero, which represents the probability that the parameter equals zero. 
The sum of these posterior probabilities that contain the variable is then the inclusion 
probability at the second stage. For instance, for the variable Xi we have 

^2SBMA^p^^^ = prCPt^'^^ ^m = Y.Y. ^^^^^^ 

where Ci is the subset of C for which the coefficient [5xi is not constrained to zero. We 
continue to follow the standard rules of thumb for interpreting effect thresholds in the second 
stage [ARl]. 

A desirable feature of the 2SBMA estimator is that asymptotically it will resemble the 
2SLS estimator. This is due to the fact that any proposed instrument or covariate that has 
a zero coefficient in either the first of second stage will be dropped in the limit by the model 
selection procedure. This yields the following result. 

Theorem 4. The 2SBMA estimate is consistent under identification in the full model. 

P2SBMA P ^f^^^ ^2SLS ^^P^ 

Proof See Appendix. This result mainly implies that one does not sacrifice consistency 
when employing 2SBMA rather than 2SLS. 

We further note that coefficients derived from instrumental variables regressions exhibit 



increasing finite sample bias as the number of instruments increases (see, e.g.. Hall (2005)). 



The bias is aggravated when the proposed instruments have little explanatory power for the 



endogenous variable (Bound et al. (1995)). While it is difficult to assess in general, the 
2SBMA methodology has the potential to limit this finite sample bias, by dropping unnec- 
essary terms. The simulation study in Section 4 shows an instance in which this occurs. 
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3 Bayesian Tests of Identification Restrictions 



Various tests have been developed to examine the instruments' conditional independence (on 
Y) and explanatory power (for W). Below we develop Bayesian equivalents of the Sargan 
( 1958 ) test of over-identification and the Cragg and Donald ( 1993 ) test of under-identification 
that Stock and Yogo (2005) used to propose a weak instruments test. We focus on these tests 
as they are widely in use by the applied community and have proven robust at detecting 
violations of the identification assumptions. Further, as shown in the simulation study of 
Section |4| they perform well in our context. We then show how model averaged versions 
of these tests can be used in the 2SBMA framework to verify model assumptions, and we 
discuss the properties of such techniques. 

To develop Bayesian model averaged versions of the Sargan test and the Cragg and 
Donald test we rely on the notion of posterior predictive p- values. Rubin (1984) argued that 
posterior predictive p-values are useful tools in applied Bayesian statistics to verify model 
assumptions. Given a hypothesis, H, about the model, the posterior predictive p-value given 
data is calculated as pr{H\D). It quantifies the degree to which the hypothesis is supported 
by the posterior distribution of the model parameters given the data D. 

In the case of 2SBMA, when the hypothesis relates to the model identification, we consider 
the following decomposition for model averaged posterior p-values: 

pr{H\D) = Y.Y1 T^^^^,MH\Mi, L„ D). (15) 
i=i j=i 



The decomposition in Equation (15) outlines the motivation for model averaged posterior 
p-values. Since the hypotheses of primary interest are related to the identification of the 
2SBMA estimate, it is natural to ask whether each sub-model which is used to form the 
2SBMA estimate is appropriately identified. Naturally, different specifications yield varying 
degrees of confidence as to whether the identification requirements are fulfilled and it is ap- 
propriate to weight posterior predictive p-values for each model by the extent to which each 
model contributes to the 2SBMA estimate. 



3.1 A Bayesian Test of the Over-Identification Restrictions 

By selecting first-stage models based on the integrated likelihood, we are explicitly choosing 
combinations of instruments based on model fit. Hall et al. (1996) note that any procedure 
that chooses the first stage based on goodness of fit measures may risk including endogenous 
variables. Therefore, it is important to develop a diagnostic that helps assess whether the 
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model selection procedure has produced such an over-identification failure. 

We base our test of over-identification on the Sargan ( 1958 ) test. Let r/*^*'-'-' be the residuals 
from the combination of models Mj and Lj, and let pij be the total number of X and Z 
included in this combination. The Sargan p- value S* is calculated as S* = prinR^^ > 
Xpx+pz-i^ where R^^ is the associated with the regression of ff'^^^ on all X and Z 
variables. 

Just as in the classical Sargan test, we consider the regression of j)*^*'-'-' on the subset of 
the variables X and Z that belong to either Mj or Lj, and determine i?^^-, which is the 
posterior mean R^ associated with this model speficiation. Letting S^^'^^ = piXpi^^i > 
nRfj\X, Y, Z, W), where we define the Bayesian Sargan p- value to be 

i=i j=i 

^2SBMA -g therefore the average of the Sargan p-values derived from the specific models Mj 
and Lj, weighted by their respective posterior probabilities. 

The benefit of the Bayesian 2SBMA Sargan test is that the parsimony of 2SBMA ef- 
fectively mitigates the reduction in power that the traditional Sargan test experiences as 
the dimensions of the X or Z variables grow. This increase in power can be big, as shown 
in the simulation study below. It is important to note that this increase in power derives 
from the fact that the 2SBMA estimator contains models with greater parsimony than the 
full specification as opposed to a fundamental difference between Bayesian and frequentist 
approaches. 



3.2 Bayesian Tests of Under-Identification and Weak Instruments 

While it is crucial to verify that proposed instruments do not violate the conditional inde- 
pendence assumption, it is also important to test that they have explanatory power for the 
endogenous W. When W is univariate, this may be done by considering an F test based on 



the first stage. However, when pw > 1, Cragg and Donald (1993) derived an equivalent test 
for this. Here we derive a Bayesian analog of this test. 

Consider fixed first and second stage models, Mj and Lj respectively. Let Zij be the 
instruments used in this combination, namely all those variables in Z used in Mj and 
those variables X used in Mi but excluded from Lj. Let Xj be those X contained in 
Lj, and let Vij be the matrix of all X and Z variables included in either Mj or Lj. Define 
Py^j = yij(yijVij)~^Vij and My^^, = In—Pvij where /„ is the nxn identity matrix, and similarly 
define Px, = X,{X'jXj)-^X'j and Mx, = /„ - Px,, and finally define dj = t-^^^'^Qi^t^^'^ 
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where = W'My^W and 6,, = (M^, W^)'Mx,%((Mx,%)'Mx,%)-HMx,%)'Mx, W^. 
The Cragg and Donald statistic under model Mj and Lj can then be derived as the mini- 
mum eigenvalue of Gij, gij = mineigenGjj. 

In practice, the statistic Qij is used in two ways. Asymptotically, under the null hypoth- 
esis of under-identification, raoj,- ~ _i, and this reference distribution is used to derive 
a posterior predictive p-value. Here we propose a Bayesian model-averaged version of this 
posterior p-value by considering 

/ J 

CD = ^^7riUidPr{xlz^^-i > ngij). 
i=i j=i 



A second use of gij was suggested by Stock and Yogo (2005), but their test statistic 
provides only critical values, not the necessary p-values that can be averaged when models 
have different numbers of instruments. The Bayesian approach can also be used to assess an 
apparent weakness of an instrument. The approach is simple and direct: it simply requires 
an examination of the instruments' inclusion probabilities. 



4 Simulation Study 



We conduct a simulation study to quantify the estimation properties of 2SBMA and the 
behavior of the Bayesian tests of the identification restrictions. 

In the following we consider a framework in which there are ten variables in Z, 15 in X 
and W is univariate. Our construction of the variables in X and Z is similar to the simulation 



study in Fernandez et al. (2001), which was based in turn on Raftery et al. (1997). 

For constructing X we let (Xi . . . Xio) be an x 10 matrix of independent draws from 
a N{0, 1) distribution. We set 

(Xn . . . Xis) = (Xi ... X5)(.3 .5 .7 .9 1.1/(1 ... 1) + E, (16) 



where is an n x 5 matrix of draws from the X(0, 1) distribution. Note that Equation (16) 
induces a correlation between the first five regressors and the last five regressors. It takes 
the form of small to moderate correlations between the first five variables in X and the last 
five where the the theoretical correlation coefficients increase from 0.153 to 0.561. 

The matrix Z is sampled in a similar manner. [Zi . . . Z^ is an x 5 matrix of standard 
normal variates and we then set 

(Ze . . . Zio) = (Zi ... Z5)(.3 .5 .7 .9 1.1)'(1 ... 1) + F, 
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where F is an n x 5 matrix of independent draws from the A^(0, 1) distribution. Finally, we 
construct the model 

Y = W + 1.8X1 + 1.5X2 + Xn - 1.5X12 + 2r7, (17) 
W = 1.1X1 - .5X3 + .75X12 + .75Zi - 2Zs + 3e. (18) 

Thus, we consider a situation in which four covariates along with W have explanatory 
power for Y. Furthermore, two variables in Z serve as instruments and two of the variables 
in X have explanatory power on both Y and W. Finally, one variable in X (X3) would be 
more properly classified as an instrument, as it has explanatory power on W but not on Y. 

We introduce endogeneity by drawing e from a X(0, 1) distribution and setting rj = e + C,, 
with ^ drawn from a X(0, 1) distribution as well. We then consider two scenarios. The first 
scenario is one in which the IV model is correctly specified, i.e. the Z covariates have no 
effect on Y. In the second scenario we consider a misspecified model in which t] = Zi+e + C,, 
so that the instrument condition fails. In each scenario we simulate datasets of 100 obser- 
vations and consider 500 replicates. The simulation study is structured to roughly resemble 
the data set we will be examining below. 

Figure [T] shows the distribution of the estimate of Pw across replications using 2SBMA, 
2SLS and OLS. We see that the OLS estimates are centered about a value of 1.3. Indeed, 
in this case the OLS estimate will asymptotically approach this value. Both 2SBMA and 
2SLS rectify this bias and are more closely centered about the true value of 1. However, the 
average bias and average mean squared error of 2SBMA are about 45% lower than those of 
2SLS, so 2SBMA performs much better than OLS or 2SLS. 

The first panel in Figure |2] shows the distribution of the p-values returned from the 
Bayesian Sargan test as well as the traditional Sargan test. We see that the ]?- values from 
the Bayesian Sargan test are much higher. However, these scores are still sufficiently low 
that the exogeneity assumption is unlikely to be incorrectly rejected. 

The second panel in Figure [2] shows the resulting Bayesian Sargan and classical Sargan 
p-values for the case of a misspecified exogeneity assumption. In the case of valid instru- 
ments, the size of both tests was 0. However, in the case of invalid instruments the power 
of the Bayesian Sargan test was 50%, whereas it was 0.8% using the traditional Sargan test, 
based on a = .05. We see that the Bayesian Sargan test is more precise in discerning the 
failure of the exogeneity assumption and is far more likely to reject the hypothesis that the 
IV assumptions are valid than the classical Sargan test. As noted above, this result derives 
from the fact that the 2SBMA estimates are formed over models with considerably greater 
parsimony than the full specification, which highlights the benefits of parsimonious models 
and the automatic manner in which 2SBMA is able to propose these sparse models. For 
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Figure 1: Finite Sample Bias under 2SBMA, 2SLS and OLS. Distribution of the estimate of the 
coefficient f3\Y across rephcations using 2SBMA, 2SLS and OLS, when P]y = 1. The average bias 
of across 500 rephcations was .019, .035 and .310 for 2SBMA, 2SLS and OLS respectively. The 
average mean squared error for estimating the entire vector /3 was .213, .395 and .840 for 2SBMA, 
2SLS and OLS respectively. 




the case of valid instruments, both the classical and Bayesian Cragg-Donald tests correctly 
rejected the null hypothesis of no identification for all repetitions. 

The previous figures show that 2SBMA returns appropriate coefficient estimates and 
yields improved power at detecting assumption violations over traditional methods. Table [T] 
shows that the technique also uncovers the pattern of interaction in both stages of the estima- 
tion. When the model is correctly specified. Table [T] shows the median inclusion probability 
for each variable across the 500 replications in both stages as well as the interquartile range 
of the inclusion probabilities. We see that in the first stage the two variables in Z as well as 
the three variables in X are given high inclusion probabilities, while the remaining variables 
are generally excluded. This remains true in the second stage as well, where W and the four 
covariates in X that have explanatory power are given large inclusion probabilities and all 
others are given negligible inclusion probabilities. 
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Figure 2: Distribution of p- values returned by the Bayesian Sargan test and the Sargan test across 
rephcations when the IV assumptions hold (left) and when they do not (right [AR2]). In all cases, 
a nominal level of 5% was used, implying that the assumption of valid instruments would be 
rejected when the p-value was above .95, shown by the dotted line in each plot. In the case of 
valid instruments, the size of both tests was (no false rejections). However, in the case of invalid 
instruments the power of the Bayesian Sargan test was 50% (half the cases were correctly rejected), 
whereas it was 0.8% using the traditional Sargan test (4 cases out of 500 were correctly rejected). 



Valid Instruments 



Invalid Instruments 




Table 1: Median and interquartile range (IQR) of variable inclusion probabilities across 500 repeti- 
tions. Variables shown in bold are those that are included in either the first or second stage. This 
table shows that inclusion probabilities closely match the true structure of the system. 



First Stage Second Stage 

Variable Median IQR Median IQR 



w 






1 


(1,1) 




1 


(1,1) 


0.761 


(0.573,0.999) 


X2 


0.017 


(0,0.064) 


0.688 


(0.222,0.962) 


Xs 


0.561 


(0.167,0.943) 


0.065 


(0.018,0.064) 


Xi 


0.031 


(0,0.081) 


0.092 


(0.023,0.096) 


^5 


0.031 


(0,0.102) 


0.09 


(0.024,0.086) 


^6 


0.019 


(0,0.082) 


0.091 


(0.016,0.094) 


Xi 


0.014 


(0,0.067) 


0.086 


(0.014,0.083) 


Xs 


0.026 


(0,0.084) 


0.077 


(0.018,0.07) 


Xg 


0.018 


(0,0.077) 


0.077 


(0.016,0.088) 


Xio 


0.014 


(0,0.074) 


0.072 


(0.014,0.065) 


Xii 


0.021 


(0,0.088) 


0.473 


(0.1,0.874) 


Xl2 


1 


(1,1) 


0.72 


(0.492,0.987) 


Xi3 


0.016 


(0,0.076) 


0.094 


(0.023,0.08) 


Xi4 


0.02 


((),(). 096) 


0.107 


(0.027,0.109) 


Xi5 


0.022 


((),(). 096) 


0.112 


(0.027,0.117) 


Zi 


0.908 


(0.963,1) 






Zi 


0.074 


(0,0.066) 






Z3 


0.079 


(0,0.063) 






Zi 


0.077 


(0,0.068) 






^5 


0.082 


(0,0.073) 






z& 


0.079 


((),(). 066) 






Z-, 


0.083 


(().(). 066) 






z» 


1 


(1,1) 






Zg 


0.075 


(0,0.063) 






Zio 


0.1 


(0,0.089) 
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5 Instrument and Determinant Uncertainty in Devel- 
opment Accounting 

We now apply 2SBMA to one of the most prominent approaches to institutions in the de- 
velopment accounting literature. The literature seeks to explain the variation in the level 
of per capita income in 1995 with alternative theories of economic development. Rodrik et 



al. (2004) (RST) juxtapose the most prominent development theories and their associated 
candidate regressors in one comprehensive approach. They conducted what they explicitly 
called a "horse race" among theories that pertain to development determinants (geography, 
integration/trade and institutions). Endogeneity is a major feature of this literature, since 
one could also argue that institutions or integration/trade are determined by high levels of 
per capita income. Hence the development accounting literature has given rise to a large 
set of alternative theories and their alternative instruments that can be used to identify 
the effect of institutions and trade on development to resolve the endogeneity issues. This 
implies that model uncertainty is present in both the development determinant and in the 
instrument stage. With less than 100 observations, the RST sample is a standard size of 
datasets in development accounting. 

RST explored over 25 different robustness specifications with different candidate regres- 
sors that are suggested by a comprehensive set of theories that span the literature. Their 
results are so uniform and decisive across all specifications that their claim to have resolved 
model uncertainty is well captured by their title: "Institutions rule: the primacy of institu- 
tions over geography and integration in economic development." At best, RST find geography 
may have weak direct effects while Integration is found to be "always insignificant, and of- 
ten enters the income equation with the 'wrong' sign." Their results are, however, in stark 
contrast to previous evidence of Trade/Integration, and Geography effects on development 



e.g.. Hall and Jones ( |1999D , Sachs ( |2003D ); but these papers were discounted because they 



did not provide RST's level of robustness analysis. 



^Our computations use the bicreg function from the BMA R package (Raftery et aL (2005)), since the 



Bayesian Information Criteria (BIC) closely approximates the posterior model probability under the UIP 



(Kass and Wasserman, 1995 Raftery, 1995). This allows for BIC approximations of the integrated likelihood 
in both the first and second stages. For a given value of 6'^^\ the quantity pr(Lj\9^'^\ Mi, D) is given by 
a regression model with a particular UIP and is therefore well approximated by the BIC. In addition we 
can approximate pr{Lj\Mi, D) with pr{Lj\6''^\ D) and calculate BIC relative to this The conditional 
independence assumptions of {Mi,Lj) enable us to set II^*'^) = 0. Thus in our BIC approximation the 
likelihood pr((3\Lj,9^'^\ D) is maximized at /3(*'^) defined in Equation (|7|. Note that BIC can be calculated 



from the associated of the regression. By calculating the model probability j conditional on the fitted 
value oiW, we note that the resulting is equivalent to using the "Generalized i?^" suggested by 



Pesaran and Smith (1994) for scoring the second stage model Lj. 
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Using RST's own data set, we reexamine their robustness specifications using 2SBMA to 
address the model and instrument uncertainty that is highhghted so forcefully in their paper. 
A description of each regressor is provided in Table [2] and the theories that are associated 
with each regressor are discussed extensively in RST. The 2SBMA first and second stages 
are reported in Tables [3] and |4| respectively. 

In terms of development determinants, RST assume that Geography is exogenous, so 
the upper panel in Table |3] represents the first stage for the endogenous institutions proxy 
(Rule of Law) and the lower panel in the same table is the first stage for the endogenous 
Integration variable. 

Given the 2SBMA methodology, it would be enough to present only the 2SBMA results 
that explore the entire model space spanned by RST's development determinants and the 
associated instruments. This specification is provided in Column 3. We first provide, how- 
ever, two intermediate stages, where Column 1 represents RST's "core specification" (RST's 
Table 2) and Column 2 is the first set of RST's robustness exercises. The three-step approach 
highlights the sensitivity of the core specification to the introduction of additional covariates 
that are associated with different theories (in Column 2), as well as sensitivity to different 
variables associated with the different theories (in Column 3). 

Column 1 in Table |4] provides the second stage of RST's preferred core specification 
(RST's Table 2). Both RST and 2SBMA find that only Rule of Law shows an effect and the 
conditional posterior mean is nearly identical to RST's 2SLS estimate. In this specification, 
the 2SBMA result confirms RST's central finding that "the preferred specification accounts 
for about half of the variance in incomes across the sample, with institutional quality (in- 
strumented by settler mortality) doing most of the work." The generalized for the best 
2SBMA model is 0.53 versus 0.55 in RST's 2SLS approach. 

Column 1 in Table [3] reports the two 2SBMA first stages for RST's core specification. 
They broadly confirm the 2SLS results in RST although 2SBMA suggests slightly more 
parsimonious models. 2SBMA suggests three strong instruments for Rule of Law (Settler 
Mortality, Latitude, and the Fraction Speaking English), while RST found significant coef- 
ficients for all five instruments across their various 2SLS exercises. This generates a slightly 
higher for RST's preferred 2SLS specification (0.55) as compared to the best model in 
2SBMA (0.49). For Integration, the 2SBMA first stage suggests only two strong instruments 
(Implied Trade Shares and Settler Mortality) while 2SLS produces a statistically significant 
coefficient for an additional instrument (Fraction Speaking English). Nevertheless the R^ of 
the 2SBMA best model and of the 2SLS first stage are identical (0.58). 

RST find that core specifications with more than one instrument fail to pass the Sargan 
test. This finding is confirmed by the Bayesian Sargan test in Column 1 of Table [4], which 
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presents a similar p-value to that found by RST. One interpretation is that the Sargan test 
undermines alternative determinant and instrument strategies as suggested by RST. One 
could also argue, however, that RST's specification does not contain the appropriate set 
of instruments. We examine this issue further below but note that already at this stage, 
under-identification (as measured by the Bayesian Cragg-Donald p-value) is easily rejected 
by 2SBMA (not reported in RST). Weak instruments are not of concern in this data set. 

The 2SLS results in RST's core specification and the 2SBMA results in Column 1 are 
nearly identical because the core specification includes minimal model uncertainty at the 
determinant level and only a fraction of the standard instruments suggested by the devel- 
opment literature. Column 2 adds regressors suggested by alternative theories that pertain 
to Legal Origins and Religion, as well as regional dummies, while Column 3 represents the 
most comprehensive set of regressors that adds standard covariates related to alternative 
Geography theories (most notably Temperature, Malaria) as well as alternative Integration 
measures (such as Sea Access). As we allow for additional theories and the associated regres- 
sors, the 2SBMA results start to diverge from the results in RST's individual 2SLS robustness 
regressions. In other words, the disparities across results become more pronounced as model 
uncertainty increases. 

The 2SBMA results that use the most comprehensive set of instruments and development 
determinants (Column 3 in Tables [3] and |4]) , cast doubt on the primacy of institutions result. 
Instead 2SBMA finds that the "horse race" ends in a statistical three-way tie. Geography (as 
measured by Tropics), Institutions and Integration are shown to be highly effective develop- 
ment determinants. This result is particularly surprising in light of the fact that Geography 
is only occasionally weakly significant in RST, and Integration is never significant and often 
of the wrong sign. In 2SBMA all three effects are strong and estimated with the correct sign. 
Once model uncertainty is comprehensively addressed at both the development determinant, 
as well as the instrument level, the results thus support the contentions of Sachs (2003) and 



Alcala and Ciccone (2004), who report strong effects of Geography and Integration. 

The divergence of 2SLS and 2SBMA results can already be observed in the first stagesj^ 
Most importantly, the Implied Trade Share no longer receives support as a strong instru- 
ment for Integration. It is most strongly instrumented by EuroFrac in combination with the 
covariates PopGrowth, Oil, SeaAccess, Malaria94, EuroFrac, Tropics, Latitude, FrostArea, 
and PolicyOpenness. In contrast to the findings of RST, religion variables also play an im- 
portant part in the first stage regression. In particular. Catholic has nearly a 90 percent 
inclusion probability in the first stage for Rule of Law and above 50 percent in the first stage 
for Integration. Similarly, the power of Settler Mortality as an instrument for Institutions is 



^RST report neither first stages nor tests of instrument restrictions beyond the core specification. 
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Table 2: Variable Descriptions from RST dataset. 



Variable Name 


Description 




Area 


Land area (thousands sq. mt.) 




Catholic 


Dummy variable taking value 1 if the countrys population is predominantly catholic 




East Asia 


Dummy variable taking value 1 if a country belongs to South-East Asia, otherwise 




EngFrac 


Fraction of the population speaking English. 




FR_tradcShares 


Natural logarithm of predicted trade shares computed from a bilateral trade equation with pure geography variables. 


FrostArea 


Proportion of land with ^5 frost-days per month in winter. 




Frost Days 


Average number of frost-days per month in winter. 




Integration 


Natural logarithm of openness. Openness is given by the ratio of (nominal) imports plus 


exports to GDP (in nominal US dollars). 


Latin America 


Dummy variable taking value 1 if a country belongs to Latin America or the Caribbean, 


otherwise 


Latitude 


Distance from Equator of capital city measured as abs(Latitude)/90 




LegalOrigFr 


variable taking a value of 1 if a country has a legal system deriving from that in France 




LcgalOrigSocialist 


variable taking a value of 1 if a country has a socialist legal system 




Malaria94 


Malaria index, year 1994. 




MeanTemp 


Average temperature (Celsius). 




Muslim 


Dummy variable taking value 1 if the countrys population is predominantly muslim 




Oil 


variable taking value 1 for a country being major oil exporter, otherwise. 




Policy Openness 


Dummy variable that indicates if a country has sufficiently market oriented policies 




PopGrowth 


population growth 




Protestant 


variable taking value 1 if the countrys population is predominantly protestant 




RuleofLaw 


Rule of Law index. Refers to 2001 and approximates for 1990s institutions 




SeaAccess 


Dummy variable taking value 1 for countries without access to the sea, otherwise. 




SettlerMortality 


Natural logarithm of estimated European settlers mortality rate 




SubSahar a Africa 


taking value 1 if a country belongs to Sub-Saharan Africa, otherwise 




Tropics 


Percentage of tropical land area. 





dominated by regressors such as EuroFrac and Temperature variables in both first stages. 

Note that the increase in the model space of development determinants and instruments 
going from the specification in Column 1 to Column 3 dramatically improves the fit of the 
2SBMA first stage. For Column 3, the best models in both 2SBMA first stages report an 

that is at least 40 percent greater than those found in the core specification. 

A similar improvement in model fit can be observed in the second stage when considering 
the generalized (Pesaran and Smith, 1994) of the best model returned by 2SBMA. In fact, 
none of the top 100 models' generalized R^ falls below .82, which greatly exceeds any model 
presented by RST (whose highest generalized R^ is .73). 2SBMA has therefore uncovered 
combinations of instruments and development determinants that fit the data substantially 
better. This is then the source of the difference in the 2SLS and 2SBMA results in both 
the first and second stages. The Bayesian Sargan and Bayesian Cragg and Donald tests 
clearly show, respectively, that over-identification is easily rejected with the improved set of 
instruments and that under-identification remains of little concern. 
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Table 3: First Stage Results for RST Example 







I 






II 






III 








RST Table 2 






RST Table 2, 


4 


RST Table 2, 4, 


5 6 




Core Specification 


I + LegalOrig, Rclig 


Region 


II + Alt. 


Int e^r. / Gleo 


Measures 








Stage 1, Dependent Variab 


le: Rule of Law 








p 7^ 


Mean 


Sd 




Mean 


Sd 




Mean 


Sd 


bcttlcrJX'Iortality 


92.7 


-0.21189 


0.07113 


25.5 


-0.1U696 


0.07218 


17.1 


-0.02528 


0.06317 


EuroFrac 


16.5 


0.23 


0.23093 


99.9 


1.66071 


0.31244 


100 


1.03 


U. J994 


Catholic 








14.9 


-0.00478 


0.00381 


89.9 


-U.Ui4Uo 


A AACr^/3 

O.OOOYD 


MeanTemp 














86.8 


-0.05535 


U. 0^794 


PopGrowth 














72.5 


-0.1011 


0.08351 


SubSaharaAfrica 








10.7 


-0.34126 


0.30796 


55.5 


-0.2287 


0.2442 


Muslim 








13 


-0.00434 


0.00393 


40.2 


A AAOH/I 


0.003 


Latitude 


89.7 


0.02254 


0.00792 


99.1 


0.02919 


0.00732 


20.5 


U.0U41± 


0.00938 


Latin America 








99.9 


-1.00161 


0.27639 


14.9 


-0.1277 


0.3428 


Area 














12.8 


.02582 


.07443 


Oil 














8.8 


A A "7 Q 


0. 1377 


i'Jrt-lraac bnarcs 


38.5 


0.17551 


0.09726 


99 


0.28821 


0.08545 


8 


-u.uxy lo 


A A O 1 C O 


Tropics 














7.9 


A AOQnO 


0. 1062 


EngFrac 


98.6 


1.07778 


0.28172 


12.2 


0.37306 


0.36301 


7.5 


n ^\Af^^ i 
U.LXtDl / 




FrostArea 














6.3 


U.U4Dyo 


0.2127 


Protestant 








10.7 


-0.00702 


0.00619 


3.8 


U.UUU Jo 


A AA1 CA 


Frost Days 














1.9 


A AAA/1 Q 

U.UUU40 


A AA^oe 
U.OO * 2.0 


LcgalOrigFr 








46.7 


-0.30591 


0.16008 


1.8 


A AAQ O '7 


A A O Cr" O O 


ScaAcccss 














1.4 


0.00211 


U.Uz i\jO 


Policy Openness 














1.1 


U.UU^DO 


A A O O O 










91.4 


0.72988 


0.26051 











Malaria94 























LcgalOrigSocialist 








63.9 


-0.77728 


0.36386 


na 


na 


na 


BIG best model 


-41.53 






-53.81 






-51.42 






R2 best model 


0.49 






0.66 






0.75 












Stage 1, Dependent Variable: Integration 










Mean 


Sd 


PitO 


Mean 


Sd 




Mean 


Sd 


FR_Tradc Shares 


100 


0.5985 


0.0612 


100 


0.5769 


0.0502 


0.7 


-0.086 


0.1074 


LcgalOrigSocialist 








20.1 


-0.2561 


0.2001 


na 


na 


na 


PopGrowth 














100 


-0.2735 


0.0285 


SeaAccess 














94.8 


-0.3023 


0.106 


Oil 














94.4 


0.3445 


0.1284 


Malaria94 














91.8 


-0.4383 


0.1399 


EuroFrac 


14.4 


-0.1053 


0.1389 


6.1 


0.0563 


0.1329 


81.2 


-0.5145 


0.1826 


Tropics 














73.1 


0.4392 


0.1921 


Latitude 


23.5 


-0.0065 


0.005 


4.9 


0.0003 


0.0039 


72.7 


-0.0164 


0.007 


FrostArea 














65.3 


0.497 


0.2019 


PolicyOpenncss 














59.1 


0.3468 


0.1391 


Catholic 








7.3 


0.001 


0.0014 


52.5 


-0.0036 


0.0018 


SettlerMortality 


84.9 


-0.1111 


0.0408 


9 


-0.0349 


0.0371 


50.2 


-0.1077 


0.0579 


EastAsia 








100 


0.8236 


0.139 


28.2 


0.2917 


0.1663 


EngFrac 


23.2 


0.246 


0.1865 


83.6 


0.382 


0.1431 


24.7 


-0.6486 


0.3051 


Frost Days 














17.1 


0.0217 


0.0125 


Latin America 








6.3 


-0.0448 


0.1147 


16.6 


-0.4149 


0.1899 


MeanTemp 














13.5 


-0.0225 


0.0118 


SubSaharaAfrica 








5.3 


-0.033 


0.0925 


4 


-0.1922 


0.1627 


LcgalOrigFr 








6 


0.0464 


0.1039 


3.8 


-0.095 


0.0902 


Protestant 








11.9 


0.0041 


0.0035 


0.7 


0.0024 


0.0029 


Muslim 








6.3 


-0.0006 


0.0012 


0.2 


-0.0007 


0.0015 


Area 














0.2 








BIC best model 


-61.22 






-84.37 






-64.57 






R2 best model 


0.58 






0.71 






0.81 
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Table 4: Second Stage Results for RST Example 







I 






II 






III 








RST Table 


2 


RST Table 


2, 4 


RST Table 2, 4, 


5. 6 




Core Specification 


I -f LegalOrig, Relig, Region 


II + Alt. 


Intcgr./ Geo 


Measures 








Stage 2, Dependent 


Variable: 


log GDP per 


capita in 1995 






p / 


Mean 


Sd 


P 7^ 


Mean 


Sd 


P 


Mean 


Sd 


Rule of Law 


100 


1.2775 


0.1772 


100 


0.9485 


0.1323 


96.4 


0.7979 


0.3155 


Integration 


20 


0.1119 


0.2578 


7.4 


0.0697 


0.1451 


84.7 


0.9275 


0.3803 


Tropics 














69 


-0.7828 


0.37 


Area 














57.1 


.164 


.171 


SubSahar a Africa 








97 


-0.7487 


0.1998 


50.7 


-0.5319 


0.3077 


Catholic 








36.2 


0.0043 


0.0028 


50.6 


0.01 


0.0072 


Policy Openness 














49.4 


0.6857 


0.368 


PopGrowth 
















2099 


0. 1473 


Muslim 








50.3 


-0.0044 


0.0025 


43.8 


-0.0043 


0.0035 


Latin America 








10.1 


0.0984 


0.2858 


36.1 


0.6529 


0.3652 


LcgalOrigFr 








29.5 


0.2083 


0.2065 


34.6 


0.29 


0.1682 


Frost Area 














33.3 


1.2204 




Frost Day 














31.3 


-0.0621 


0.0383 


MeanTemp 














22.2 


0.0323 


0.0433 


East Asia 








22.8 


0.3345 


0.3127 


19.5 


0.532 


0.3898 


Latitude 


18.3 


-0.0019 


0.0143 


10.8 


-0.0058 


0.0099 


18.6 


-0.0168 


0.0162 


Oil 














18 


0.323 


0.2919 


Malaria94 














7.3 


-0.243 


0.4787 


SeaAccess 














5.6 


-0.0698 


0.3142 


Protestant 








8 


-0.0027 


0.006 


1.9 


-0.0016 


0.0069 


LcgalOrigSocialist 








41 


-0.6144 


0.4917 


na 


na 


na 


BIG best model 


-57.34 






-92.34 






-77.12 






Generalized R2 best model 


0.53 






75.10 






85.70 






Bayes/Sargan p-value 


0.0308 






0.7581 






0.8438 






Bayes/Cragg-Donald p-valuc 


0.0000 






0.0000 






0.0097 







6 Conclusion 

We have developed a methodology to address model uncertainty in the presence of endogene- 
ity and explored its properties as a valid IV estimator. The method is based on Bayesian 
Model Averaging (BMA), which has already been extensively used in economic applica- 
tions, particularly in modeling economic growth and development. 2SBMA is shown to 
be a consistent methodology that merges the BMA and 2SLS procedures in a natural and 
straightforward manner. 

In order to extend the 2SLS paradigm to a Bayesian context, we introduced a Two-Stage 
UIP, which built on the development of the Bayesian two-stage prior of Kleibergen and Zivot 



(2003). This allowed us to leverage existing widely-used software for regression variable un- 



certainty (Raftery et al., 2005). 

A key assumption in our development was that the errors were normally distributed. 
Such an assumption is not necessary in order for standard 2SLS to function properly as a 
consistent estimator, but it was important in specifying our two-stage UIP and is similar 



to most Bayesian developments of the endogenous variable model (Dreze, 1976 Kleibergen 



and Zivot, 2003). Relaxing the normality assumption therefore constitutes an important 
further step in the development of this methodology. One potential avenue for performing 
this relaxation is to consider extending the Dirichlet process approach employed by Conley 



et al. (2008). 



Instrumental variable estimation of any kind requires a number of assumptions that re- 
late to the identification of the implied structural model. To examine the validity of these 
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assumptions in the context of 2SBMA, we propose Bayesian Sargan and Bayesian Cragg 
and Donald tests of the over- and under-identification restrictions. These tests are based on 
model averaged posterior p- values within the 2SBMA framework. We show that the power of 
these tests is substantially greater than those of standard 2SLS tests using the full specifica- 
tion. In addition, within the context of 2SBMA these tests are less affected by an increase in 
the number of potential instruments. We have focused on deriving these particular tests in 
part because they are well-known to the majority of applied practitioners. Furthermore, we 
have shown that our implementation of these tests, in particular the Bayesian Sargan test, 
performs in a satisfactory manner. Alternative frameworks of verifying model assumptions 
based more directly on the comparison of Bayes Factors, or tests that deal with more compli- 
cated sampling assumptions, for instance heteroskedasticity could have also been proposed. 
It will be an interesting and important development to see what additional insight such tests 
could provide. 

We apply 2SBMA to a prominent development accounting approach (Rodrik et al. 
(2004)), which was itself motivated by the vast model uncertainty associated with alterna- 
tive theories of development and alternative instruments to control for potential endogeneity. 
Instead of resolving the model uncertainty in a horse race of alternative regressions, we use 
the formal 2SBMA approach. We find not only support for institutions, but also substantial 
support for geographic and trade factors, once model uncertainty in the presence of endo- 
geneity is addressed. The latter two effects had been relegated to second order effects by 
RST. 

The purpose of our paper is to introduce the statistical foundations of 2SBMA method- 
ology and provide applications that highlight the importance of instrument and covariate 
uncertainty in economics. In the context of the RST example, alternative approaches to de- 



velopment accounting include Mauro (1995), who first suggested ethnolinguistic fragmenta- 
tion as a fundamental determinant of corruption, and Hall and Jones (1999), who introduced 
Latitude and Language indicators as instruments to measure Western influence. Acemoglu 



et al. (2001b 2001a) suggested population density in 1500 and the type of colonial origin 
(indicated by settler mortality) as effective instruments, respectively. La Porta et al. (2004) 
presented yet another "horse race" of theories, in their case juxtaposing judicial independence 
and constitutional review. In RST the "horse race" is between three possible determinants: 
Institutions, Integration, and Geography. Geography-based theories of fundamental devel- 



opment determinants have previously been proposed by Bloom and Sachs (1998), Easterly 
and Levine ( [2003| ), and Sachs ( |2003| ). 

Several modifications of the BMA paradigm have been proposed in the development de- 



terminant literature. Brock et al. (2003) and Durlauf et al. (2008) discuss priors on the 
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model space that account for the fact that many variables may be collected to proxy one 
particular theory, while fewer may be available to proxy an alternative theory. Ley and 



Steel (2007) and Doppelhofer and Weeks (2009) develop metrics to quantify the degree to 
which development determinants act "jointly" to affect growth. Determining how these ex- 
tensions of the BMA paradigm may be taken into account in the 2SBMA framework would 
help extend the application of 2SBMA to the particular problem of testing growth theory 
robustness. 

The 2SBMA method allows researchers to incorporate concepts of model uncertainty and 
model averaging into the assessment of a diverse range of economic behavior where observa- 
tions are subject to endogeneity. However, the current framework does not directly handle 
such concepts as panel data, mixed effects, random coefficient models, and heteroskedastic- 
ity. Future research into these areas will improve the applicability of the BMA framework 
to economic analysis, in growth economics and beyond. 
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Appendix 



Proof of Theorem^ In what follows, for a matrix A, let = A{A'A)^'^A' and = 
/ — Pa- The likelihood equations follow directly from the marginal distribution of e and the 
conditional distribution of ^ given e. Furthermore, the fact that 9 maximizes Equation [s] 
follows directly from standard results for the MLE of a regression equation. 

Now, consider /3. Note that if U is in the column space of V, which occurs in this case, 
then U = PyU. By moving to log likelihoods, and differentiating with respect to the vector 
/3 we see that 

/3 = {U'uy^U'Y + <f){U'U)-^U'e (A-1) 

but note that 

U'e = U'PvMvV = 
Since PyMy = 0, thus Equation \aa\ becomes /3 = P"^^^^ . □ 



Proof of Theorem This follows directly from the proof of Theorem [T} However, we 
note that it is possible that U'^^'' may not be written as PyHjU^^^ for arbitrary Mj and Lj, 
which causes the additional 0(*'-')n'^*'-'^ to be present. □ 

Proof of Theorem^ Note that using the standard BMA results, the variance of ^^sbma 
can be written as 



i=l j=l 1=1 j=l 

Rewriting this we have. 



(A-2) 



j2SBMA\2 



i=i L j=i 

' f ^ r 

= E E ^'^^ Var{P^'^^^) + - P^'''^ + /J^*'') - /3' 

i=i I i=i 

i=i I i=i 



which results since. 



2SBMA\ 



0. 



(A-3) 
(A-4) 
(A-5) 
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Reordering the terms we then obtain 

1=1 1=1 

as desired. □ 

Proof of Theorem ^ For convenience, suppose that Mi G is the true model for the 
first stage. By true model, we do not mean the underlying model that gave rise to W, but 
the correct subset of V for which the associated elements in 6 are not zero. Then, 

TTi — )-p 1 and 71 j — )-p 0, j 7^ 1 as n — )■ cxo. 

by the consistency of the model selection procedure. Furthermore, suppose that Li G £ is 
the true second stage model. Then, 

^1,1 -^p 1 and ui j -^p 0, j 7^ 1 as n — )■ oo. 

Therefore, 

p2SBMA^^ ^(1,1) _ 

Finally consider ^^^^^ . We know that f3'^^^^ -^^ f3 by the consistency of the technique. 
Furthermore, since the first and second stage estimates of 2SLS are individually consistent 
we have ^^^^^ — j.^ provided Mi and Li are the true first and second stage models. 

Thus, (3^^'^^ — T-p 13, showing the technique is consistent. □ 
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